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DESCRIPTION 
DIDEOXY DYE TERMINATORS 

FIELD OF THE TNVEKTTTQN 

This invention relates to dye terminator nucleic 
acid sequencing aind reagents for such sequencing. 

BACKGROUND OF THE INVENTION 
5 The following is a discussion of the relevant art, 

none of which is admitted to be prior art to the 
appended claims. 

Sequence reaction products must be labeled. This 
can be done using labeled primers, labeled nucleotides 

10 (usually radioactive dNTPs) or labeled ddNTP 

terminators. The use of labeled terminators has the 
advantage of leaving false-stops undetectable. 

DNA sequence bands do not necessarily have uniform 
intensities. It is useful to express band intensity 

15 variability numerically. This can be done by reporting 
the ratio of maximum to minimum intensity of nearby 
bands (within a window of perhaps 40 bases) in a DNA 
sequence or, with normalization and correction for 
systematic "drift" in intensity by reporting the root 

20 mean square of band intensities (typically peak 

heights) (Fuller, C.W.^ Coimente 16(3) :l-8, 1989). It is 
advantageous to have uniformity of band intensity as 
sequence accuracy and read-length is in5)roved with, bands 
of more uniform intensity. 
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For accurate reading, the mobility of any given 
sequencing reaction product must migrate through the 
electrophoresis gel with a speed proportional only to 
its length. Products which migrate faster or slower 
5 than normal for a given length will result in sequence 
ambiguities or errors known as " congress ions " . 

Anomalous migration speed cein be caused by 
secondary structure of the DNA and is apparently the 
cause of most "^coitpression" artifacts seen in 

10 radioactive -IcQDel (and other) sequencing experiments at 
GC-rich regions < These can often be resolved by the use 
of analogs of dGTP such as 7-deaza-dGTP or dITP. 
Another compress ion- like artifact is observed when some 
dye-labeled ddNTPs are used for sequencing. Several 

15 examples of this can be seen in Lee, L.G,, Connell, 
C.R., Woo, S.L., Cheng, R.D., McArdle, B.F., Fuller, 
C-W., Hallorcin, N.C., and Wilson, R,K. , Nucleic AcidB 
Res., 20:2471-2483, 1992 (see figures 4g, 4h and 6h 
using ddCTP labeled with tetramethylbodipy and TMR or 

20 ddGTP labeled with bif luor) . These compression- like 
artifacts are produced, even in sequences which are 
compression- free when sequenced radioactively or with 
dye-labeled primers. These artifacts cem sometimes be 
eliminated by substituting dITP for dGTP or alpharthio 

25 dNTPs for normal dNTPs (Lee, L.G. et al.. Nucleic Acida 
Res., 20: 2471-2483, 1992; U.S. Patent No. 5,187,085), 
Similar artifacts seen with the fluorescein dye-labeled 
ddNTPs sold by Applied Biosys terns for dye -terminator 
secjuencing with T7 DNA polymerase are resolved by 

30 substituting alpha-thio dNTPs for normal dNTPs (Lee, 
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L.G. et al., Nucleic Acids Res . , 20: 2471-2483, 1992; 
U.S. Patent No. 5,187,085). 

Prober , J . M . , Trainer , G . L . , Dam, R . J . , Hobbs , 
F.W., Robertson, C.W., Zagursky, R.J., Cocuzza, A.J., 
5 Jensen, M.A. and Baumeister, K. , Science 238:336-41 
(1987) performed sequencing using terminators labeled 
with substituted succinyl -fluoresceins with linkers of 
10 atoms in length, together with dATP, dCTP, dTTP, 7- 
deaza-dGTP and AMV reverse transcriptase, and a 

10 fluorescence-detecting instrument. From Fig, 6 of this 
paper is clear that overall band intensities varied by 
more than 10 -fold, far more than the best available 
current methods with dye primers or radioactive IsJDels. 
Dideoxy NTP terminators that have the same basic 

15 structure as the Prober et ai. (1987) terminators, but 
have four rhodamine dyes used in place of the succinyl 
fluoresceins and linkers of 5 atoms in length, havie been 
used for sequencing with Taq polymerase . In order to 
use these terminators, dITP is used in place of dGTP or 

20 7-deaza-dGTP to eliminate severe "compression" 

artifacts. This method has been practiced using cloned 
Taq DNA polymerase (Bergot, WO 9105060; Parker, L.T., 
Deng, Q, Zakeri, H., Carlson, C. Nicker son, D.A., Kwok, 
P.Y., Blotecimigues 19 (1) :116-121, 1995) and with a 

25 mutant of Taq polymerase (D49G, AmpliTaq CS) lacking 5'- 
3' exonuclease activity. However, band intensities vary 
by as much as 20-fold, limiting the accuracy and read- 
length possible with the method (Parker, L.T., Zakeri, 
H., Deng, Q., Spurgeon, S., Kwok, P.Y., Nickerson, D.A., 

30 Bioteciinigues 21(4) : 694-699, 1996). 
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Lee, L,G,, Connell, C.R., Woo, S.L., Cheng, R,D*, 
McArdle, B.F.. , Fuller, C.W., Hallorand, N,D, and Wilson, 
R.K., Nucleic Acids Res., 20 :2471, 1992)describe 
sequencing with a set of ddNTP terminators and T7 DNA 
5 polymerase. All have fluorescein- type dyes attached to 
the ddNTPs in essentially the same manner as the 
rhodamine terminators used for Taq sequencing. These 
are used with modified T7 DNA polymerase (Sequenase™ 
version 2,0) and alpha-thio dNTPs. The thio dNTPs are 

10 used to resolve the "compression" artifacts like dITP is 
used for the Taq dye- terminator methods. The results 
with this system are such that bands vary in intensity 
about 10 -fold. 

Wayne Barnes has published a protocol for dye- 

15 terminator sequencing with FY modified polymerases and 
Mn^* (Scientech Corp. St. Louis, MO). Bands are more 
uniform with this method varying about 4.5-fold at most. 

Fluorescein- 12 ddNTPs that have a linker length of 
12 atoms and Biotin-11 ddNTPs that have a linker length 

20 of 11 atoms are available (Dupont NEN, Wilmington, DE) . 
These labeled ddNTPs are described as useful in 
sequencing reactions. 

ABI PRISM disclose dichlororhodamine dyes linked to 
terminators by propargyl/ethylene oxide/amino (*E0") 

25 linkers eight atoms in length for sequencing (Rosenblum, 
B.B., Lee, L.G., Spurgeon, S.L., Khan, S.H., Menchen, 
S.M., Heiner, C.R., and Chen, S.M., Nucleic Acids Res, 
25(22) :4500-4504, 1997). 
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Cyanine dyes have been utilized in dye terminators 
for sequencing (Lee et al-, Nucleic Acids Res., 
20(10) :2471, 1992) . 

SUMM?U^Y OP THE ^^fVENTIQN 
5 The present invention provides novel dldeoxy dye- 

labeled terminators which are useful in a number of 
biological processes, including providing uniform band 
intensities and the resolution of dye-induced 
compression artifacts in DNA sequencing. The dideoxy 
dye- labeled terminators of the present invention are 
particularly well suited for use with DNA polymerases 
that are thermostable or which contain an altered dNMP 
binding site (Tabor et al-, U.S. Patent No. 5,614,365). 
Use of the terminators of the present invention for 
sequencing does not require the use of nucleotide 
analogs such as dITP or alpha-thio nucleotides to 
eliminate dye-induced compression artifacts, . Applicant 
has surprisingly found that there is a strong 
correlation between the length of the link between the 
dye molecule cuid the nucleotide and band uniformity, but 
little correlation between the type of dye (or other 
parameters) and band uniformity. Dye terminators with 
linkers of 10 or more atoms (extended linkers) up to 25 

atoms (10, 11, 12 25) when used in sequencing 

reactions produce bands in sequencing gels of 
significantly -improved uniformity compared with dye 
terminators with linkers less than 10 atoms. 

The dye termininators of the present invention with 
extended linkers typically are provided in groups of 



02/25/2002, EAST Version: 1.03.0002 



wo 99/40223 



PCT/US99/02I04 



6 

four (ATGC) with or without a thermostable DNA 
polymerase and are especially useful in a method of 
sequence analysis. 

In a first aspect, the invention features a kit for 
5 DNA sequencing having a first, second, third and fourth 
dye terminator molecule, each of the dye terminator 
molecules has a dye molecule, a linker of at least 10 
atoms in length and either ddATP, ddCTP, ddGTP or ddTTP 
as a mono or tri -phosphate and a thermostable DNA 

10 polymerase. 

By Mye molecule" is meant any molecule that has a 
detectable emission spectrum, including but not limited 
to fluorescein, rhodamine, texas red, eosin, lissamine, 
coumarin, cyanine, and derivatives of these molecules, 

15 Dyes also include energy transfer dyes each comprising a 
donor and an acceptor dye . 

By ^"linker" is meant a chain of at least 10 atoms 
cotnprising carbon, nitrogen, and oxygen which links the 
dye molecule with the dideoxynucleotide. The chain may 

20 also contain substituted carbon or sulfur. Linkage 
typically occurs at the aromatic base moiety of the 
nucleotide. The first two atoms of the linker attached 
to the base are typically joined in a triple bond.. 

By ^^substituted carbon * is meant that one or more 

25 hydrogens are replaced with a substitute group such as, 
but not limited to, hydroxyl, cyano, alkoxy, oxygen, 
sulfur, nitroxy, halogen, -N(CHj)2, amino,. and -SH. 

By '"thermostable DNA polymerase'' is meant a DNA 
polymerase has a half-life of greater than 5 minutes at 

30 SO**C. Such polymerases include, but are not limited to. 
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DNA polymerases encoded by Thexmus aquaticus, ThermuB 
thermopbilus, Thertms flavus, Thermococcus littoralis, 
Pyrococcus furioaua, Thermotoga maritima, and Thermotoga 
neapolitana^ 

5 In a preferred embodiment the thermostable DNA 

polymerase has an altered dNMP binding site so as to 
improve the incorporation of dideoxynucleotides relative 
to the natural polymerase. A DNA polymerase with an 
altered dNMP binding site does not discriminate 

10 significantly between dideoxynucleotides and 

deoxynucleotides. The chance of incorporating a 
dideoxynucleotide is approximately the same as that of a 
deoxynucleotide or at least 1/10 the efficiency of a 
deoxynucleotide . 

15 In a second aspect the invention featiires a 

compound of formula (I) 

A 



A is a cyanine dye of the structure 




wherein the curved lines represent carbon atoms 
necessary for the formulation of cyanine dyes; X and Y 
20 are selected from the group consisting of O, S, and 
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CH3-C-CH3; m is an integer selected from the group 
consisting of 1, 2, 3, and 4; Rl, R2, R3, R4, R5, R6, 
and R7 are independently selected from the group 
consisting of H, OH, COaH, sulfonic acid or sulfonate 
5 groups, esters, amides, ethers, alkyl or aryl groups^ 
and B and one Rl, R2, R3, R4, R5, R6 or R7 is B. 

B is a linker of at least 10 atoms in length 
wherein the atoms are selected from the group consisting 
of carbon, nitrogen, oxygen, substituted carbon and 
10 sulfur and the linker is attached at one end to A and at 
the other end to C, 

C is a dideoxynucleotide selected from the group 
consisting of: 




and wherein the linker is covalently bonded to the 
15 dideoxynucleotide at position 7 for the purines (ddG, 
ddA) and at position 5 for the pyriraidines (ddT, ddC) 
and wherein r is a mono or tri -phosphate. 

The term ^sulfonic acid or sulfonate groups" refer 
to SO3H groups or salts thereof, 
20 The term ^ester" refers to a chemical moiety with 

formula -(R)n-COOR', where R and R' are independently 
selected from the group consisting of saturated or 
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unsaturated alkyl and f ive-membered or six-metnbered ai^l 
or heteroaryl moieties and where n is 0 or 1, 

The term '^amide" refers to a chemical substituent 
of formula -NHCOR, where R is selected from the group 
5 consisting of hydrogen, alkyl, hydroxyl, and five- 
membered or six-membered aryl or heteroaryl ring 
moieties, where the ring is optionally substituted with 
one or more substituents independently selected from the 
group consisting of alkyl, . halogen, trihalomethyl, 

10 carboxylate, nitro, or ester. 

The term '^ether" refers to a chemical moiety with 
formula R-O-R' where R and R' are independently selected 
from the group consisting of saturated or unsaturated 
alkyl and f ive-membered or six-membered aryl or 

15 heteroaryl moieties and where n is 0 or 1. 

The term "alkyl" refers to a straight -chain or. 
branched aliphatic hydrocarbon. The alkyl group is 
preferably 1 to 10 carbons, more preferably a lower 
alkyl of from 1 to 7 carbons, and most preferably 1 to 4 

20 carbons. Typical alkyl groups include methyl, ethyl, 
propyl, isopropyl, butyl, isobutyl, tertiary butyl, 
pentyl, hexyl and the like. The alkyl group may be 
sxibstituted and some typical alkyl substituents include 
hydroxyl, cyano, alkoxy, oxygen, sulfur, nitroxy, 

25 halogen, -N (013)2, amino, and -SH. 

The term ^^aryl" refers to an aromatic group which 
has at least one ring having a conjugated .pi electron 
system and includes both carbocyclic aryl (e.g. phenyl) 
and heterocyclic aryl groups (e.g. pyridine). The term 

30 ^^carbocyclic* refers to a cowpound which contains one or 
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more covalently closed ring structures, and that the 
atoms forming the backbone of the ring are all carbon 
atoms. The term thus distinguishes carbocyclic from 
heterocyclic rings in which the ring backbone contains 
5 at least one atom which is different from carbon. The 
term ^heteroarly" refers to an aryl group which contains 
at least one heterocyclic ring. 

In a preferred embodiment the linker is selected 
from the group consisting of: 

10 -CBC-CHa-NH-C0-(CHa)5-NH-CO-, 
-CEC-CHa-NH-CO- {CH3) ,-NH-SOa-, 
-CeC-CHa-NH-CO- (CHa) jo-NH-CO- , 
-CEC-CH3-NH-CO- (CHa),-, 

-CEC-CHa-NH-C0-(CHa)5-NH-C0-(CHa)s-, and 
15 -CoC-CHa-NH-OO- (CHa) 5-NH-CO- (CHa) 10-NH-CO^ 

In preferred embodiments the dideoxy dye 
terminators are; a compoxind of the formula (II) : 
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;a compoimd of the formula (III) : 
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/compound of the formula (V) : 




The Cy-5.5 ddGTP and ddCTP compovinds have a linker 
5 of 10 atoms in length. The Cy-5.5 ddCTP and ddTTP 
compounds have a linker of 17 atoms in length. 

In a third aspect the invention features a 
deoxyribonucleic acid sequence containing the corapoxmd 
of formula I, 11, III , IV or V. 
10 In a preferred embodiment the invention featxires a 

kit for DNA sequencing comprising compounds of formula 
II, III, IV, and V. 
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In a further preferred embodiments the kit further 
has a thermostable DNA polymerase; the thermostable DNA 
polymerase has an altered dNMP binding site so as to 
improve the incorporation of dideoxynucleotides relative 
5 to the natural polymerase. 

Applicant has surprisingly found that the one 
parameter that most strongly correlates with band 
imiformity is the length of the linker between the dye 
and the ddNTP. Applicant has found that by extending ' 

10 the linker length between the dye and the nucleotide for 
any dye: ddNTP combination to at least 10 atoms, that 
band uniformity is substantially improved and there are 
no dye-induced compression artifacts. 

Thus, in a fourth aspect, the invention features a 

15 method for determining the nucleotide base sequence of a 
DNA molecule consisting of the steps of incubating a DNA 
molecule annealed with a primer molecule able to 
hybridize to the DNA molecule in a vessel containing a 
thermostable DNA polymerase, a dye terminator with a 

20 linker of at least 10 atoms between the dye and the 

nucleotide and separating DNA products of the incubating 
reaction according to size whereby at least a part of 
the nucleotide base sequence of the DNA molecule can be 
determined . 

25 In preferred embodiments, the dye terminator is a 

compound of formula I, II, III, IV or V; the 
thermostable DNA polymerase has an altered dNMP binding 
site* 



02/25/2002, EAST Version: 1.03.0002 



wo 99/40223 



PCT/US99/02104 



15 

Other features and advantages of the invention will 
be apparent from the following description of the 
preferred embodiments thereof, and from the claims. 
All articles, publications and patents cited in 
5 this application are hereby incorporated by reference, 
in their entirety. 

BRIEF DESCRIPTION O P THE FIGURES 
Fig. 1 presents DNA sequence data generated using 
M13mpl8 containing a 115 bp SauAI fragment from lambda 
10 inserted a the BamHI site and Cy5.5 ddGTP, ddATP, ddTTP, 
and ddCTP dye terminators . 

Fig. 2 is a graph of band intensity variability 
(rms) vs linker length (atoms) , 

DESCRIPTION OF THE PREFERRKD EMBODIMENTS 

15 The following Examples are provided for further 

illustrating various aspects and embodiments of the 
present invention and are in no way intended to be 
limiting of the scope. 

Example 1: SyntheaiH of HiH^ oxv dye terminators 

20 CV 5,5 dideoxvnnclfioside tri phoaphatea 

Dye terminators labeled with Cy5.5 were prepared 
from propargylaminodideoxynucleotids (Prober, J.M., 
Trainor, Dam, R.J., Hobbs, F.W., Robertson, C.W., 

Zagursky, R.J,, Cocuzza, A, J., Jensen, M;A. and 
25 Baumeister, K. , Science 238:336-41 (1987); U.S. Patent 
Nos. 5,242,796, 5,306,618, and 5,332,666) and *»CyDye 
Fluorolink Cy5.5 mono reactive dye* product PA2S501 
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(Amersham Life Science) to produce conqpounds II, III, 
IV, and V. In the case of ddG and ddA, the 
propargylaminonucleotide was directly reacted with the 
N-hydroxysuccinimidyl ester of the Cy5.5 dye. In the 
5 case of ddC and ddT, a longer linker was constructed by 
reacting the propargylaminonucleotide with the N- 
hydroxysuccinimidyl ester of N-trif luoroacetyl-6- 
aminocaproic acid followed by hydrolysis in aqueous 
airanonia of the trif luoroacetyl group. The resulting 

10 compound was then reacted with the N-hydroxysuccinimidyl 
ester of the Cy5.5 dye to give the 17-atom linker 
between the Cy 5.5 dye and the pyrimidine base. 

In addition to Cy 5.5 dyes, those who practice the 
art would know how to identify and utilize other dyes, 

15 including other cyanine dyes; with the appropriate 
optical properties. Also, the construction and 
attachment of various linkers is well known in the art. 
Suitable reagents for linker construction include one or 
more compounds consisting of activated forms of amino- 

20 protected alkyl or aryl amino acids such as compounds of 
the formula R-NH- (CH2)^-C0aR' or R-NH- (CHj) JCCCHj^-COaR' , 
where R is an acid- or base-labile protecting group, R' 
is a reactive ester or anhydride group, X is aryl, 0, S, 
or NH, and where n and m are 0-12. Other linkers 

25 constructed by N- or 0- or S- alkylation are also 
suitable. The exact linker length, of at least 10 
atoms, for a specific dye and dideoxynucleotide 
combination can be dietermined empirically by monitoring 
band uniformity in DNA sequencing as described (see 

30 Example 3) * 
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Example 2: 



Pye terminator CYclg ggqugnginq 



DNA cycle secjuencing was carried out using Thermo 
Sequenase™ DNA polymerase {Araersham, Cleveland, OH) and 
Cy5,5 dldeoxy dye terminators using the following cycle 
5 sequencing protocol : 

1. A master mix was prepared consisting of the 
following: 



lOX Reaction Buffer: 
15 150 raM Tris HCL pH 9-5 

35mM MgClj 

Polymerase: Thermo Sequenase™ DNA polymerase, 
lOU/^1, 0.0017U/^1, Thermoplasma acidopJiilum inorganic 
pyrophosphatase: 20mM Tris-HCl, pH 8,5, ImM DTT, O.lmM 
20 EDTA, 0.5% Tween-20, 0.5% Nonidet P-40 and 50% glycerol. 



10 



Template DNA 

lOX Reaction buffer (see below) 
Primer, 2fM 
Polymerase (see below) 
H2O 



5.0^1 
3.5/zl 

15.5/zl 



Total volume 



27 . 0/zl 
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2. Four microcentrifuge tubes were labeled and 2 ^1 of 
Cy5*5 labeled ddG, ddA, ddT, ddC solution was added to 
each tube. 

25:1 ddG Mix, 300 fM each of dGTP, dATP, dTTP & dCTP, 
5 12fM Cy5»5 ddGTP 

25:1 ddA Mix, 300 /zM each of dGTP, dATP, dTTP & dCTP, 
12/M Cy5.5 ddATP 

25:1 ddT Mix, 300 fM each of dGTP, dATP, dTTP & dCTP, 
12/iM Cy5.5 ddTTP 
10 25:1 ddC Mix, 300 fM each of dGTP, dATP, dTTP & dCTP, 
12fM Cy5.5 ddCTP 

3. Six fil of the master mix (from step 1) was 
aliquoted to each of the 4 tubes from step 2 above. 
Cycling was carried out as follows: 95*C (30 sec), 45- 

15 55**C (30 sec) and 72**C (60 sec) for 35 cycles then 
incubate at 72 ''C 5-7 minutes, 

4 . One fil of 8M ammonium acetate was added to each 
tube. Theil 27 fil (approximately 3 times the reaction 
volume) of chilled 100% ethanol was added. Then mixture 

20 was mixed and placed on ice for 20 minutes to 
precipitate the DNA. 

5. The mixture was centrifuged in a microcentrifuge 
(-12, OOOrpm) for 20-30 minutes at either room 
temperature or 4**C. The supernatant was removed and 

25 then 200 fil of 70% ethanol was added to wash the DNA 
pellet . 
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6. The mixture was again centrifuged for 5 minutes, 
the supernatant removed and the pellet dried (in a 
vacuum centrifuge) for 2-3 minutes, 

7. Each pellet was resuspended in 6 /il of formamide 
5 loading dye (Amersham, Cleveland , OH) , vortexed 

vigorously (10-20 sec) to ensure that all DNA was 
dissolved. The mixture was briefly centrifuged to 
collect the sample at the bottom of the tube. 

8. Samples were heated to 70*0 for 2-3 minutes to 
10 denature the DNA, then placed on ice, 

9. Then 1.5-2 pil of the volume was loaded onto a lane 
of the sequencing gel, and the gel rxin on the MICRO Gene 
Blaster instrument (VGI) . 

For this sequence, the template DNA was M13mpl8 
15 containing a 115 bp Sau3AI fragment from bacteriophage 
lambda inserted at the BamHI site (product number US 
70171 Amersham) . The primer is the -40 Forward 23-mer 
universal primer (5 ' -GTTTTCCCAGTCACGACGTTGTA-3 ' ) (SEQ. 
ID. NO. 1). Results are shown in Figure 1. 

20 Example 3: Correlation of linker length and band 

intensity varlabilitY 
Sequencing reactions were carried oiit as described 
in example 2 with various dye molecules linked to 
dideoxynucleotides with linkers of various lengths (see 
25 Table 1) . The labeled DNA products were then separated 
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on denaturing polyacrylamide gels and the labeled 
products were detected by fluorescence. The intensity 
of the bands is taken as the height of the peaks in a 
graph of fluorescence (in arbitrary units) against time, 
5 Typically, systematic variations in peak heights can be 
seen in graphs of peak heights plotted sequentially. 
These systematic variations in the peak heights can be 
modeled by least-square fitting to a second-order 
polynominal function. Dividing the peak height for each 
10 band by the value of the curve- fit polynomial function 
yields a normalized band intensity for each peak. 
Variation in these band intensities can be expressed as 

the square root of the variance ^(n2x^"(Ex)^/ii*) of. the 

normalized peak heights, which can typically have values 
15 between 0 and 1 with more variability represented by 

higher numbers (Fuller, C,W., Comments 16 (3) : 1-8, 1989). 
This value is numerically equal to root -mean- square 
(RMS) value when 1.0 is s\abtracted from the normalized 
peak heights. These values are reported in Table 1 and 
20 graphed in Fig. 2. Variability of band intensities is 
significantly reduced when linkers of 10 or more atoms 
in length were used, resulting in sequence data that was 
easier to interpret accurately. 



Table 1 





Base . 


Dye* 


Linker 
Length^ 


Band Uniformity 
(xms) 


1 


T 


Coiunarin 




0.32 


2 


G 


Llssamine 


5* 


0.77 
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3 


O 


RllO 


5« 


0,34 




4 


A 


R6G 


5*^ 


0.32 




5 


G 


R6G 


5« 


0.57 




6 


C 


ROX 


5^ 


0.36 


5 


7 


T 


TMR 


5« 


0.47 




e 


A 


TxR 


5* 


0.61 




B 


C 


Eos in 


6* 


0.40 




10 


0 


Cy3 


10^ 


0.24 




11 


A 


Cy5 


10^ 


0.15 


10 


12 


G 


CyS 


10^ 


0.21 




13 


A 


Cy5*5 


10* 


0.21 




14 


G 


Cy5-5 


10* 


0.20 




15 


A 


Fl 


12« 


0.16 




16 


C 


Fl 


12' 


. 0.20 


15 


17 


G 


Fl 


12' 


0.17 




18 


T 


Fl 


12« 


0.18 




19 


A 


R66 


12' 


0.13 




20 


T 


R6G 


12' 


0.25 




21 


A 


ROX 


12' 


0.21 


20 


22 


T 


ROX 


12' 


0.16 




23 


C 


TMR 


12* 


0.26 




24 


G 


. TMR 


X2' 


0.29 




25 


T 


TMR 


12' 


0.37 




26 


A 


TxR 


169 


0.32 


25 


27 


C 


TxR 


169 


0.24 




28 


G 


TxR 


169 


0.22 




29 


U 


TxR 


169 


0.24 




30 


A 


Cy3-Cy5 


17^ 


0.11 




31 


C 


Cy3-Cy5 


17i 


0.16 


30 


32 


G 


Cy3-Cy5 


171 


0.22 
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33 


T 


Cy3-Cy5 


17^ 


0.11 


34 


C 


CyS 


17i 


0.14 


35 


T 


Cy5 


173 


0.10 


36 


C 


Cy5,5 


17i 


0.20 


37 


T 


CyS. 5 


171 


o.ie 


38 


A 


Fl 


17*» 


0.16 


39 


C 


Fl 


17** 


0.24 


40 


6 


Fl 




0.18 


41 


T 


Fl 


17h 


0.25 


42 


T 


Fl 


24*^ 


0.24 



10 



* Abbreviations for dyes: Fl, Carboxyfluorescein; RllO, Rhodamine 
110; R6G, Rhodamine 60; ROX, Rhodamine X; TMR, tetramethylrhodamine ; 
TXR, Texas Red (Molecular Probes). The dyes Cy3, Cy3.5, CyS and 
CyS. 5 were from Amersham Life Science/ Cleveland, OH. 

15 ^ Linker length is the number of atoms between the ring structure of 
the nucleoside base (A, C, G or T) and the ring structure of the 
dye. 



20 



25 



Linker structures 
« -CBC-CHa-NH-CO- 
-CsC-CHj-NH-SOa- 
-CeC-CHa-NH-CS-NH- 
-CsC-CHa-NH-CO- (CH,) s-NH-CO- 
-C«C-CHa-NH-CO- (CH,) 
-C=C-CHa-NH-CO- (CHa) iq-NH-CO- 
-CBC-CHa-NH-CO- (CHa)8- 
-CeC-CHa-NH-CO- (CHj) 5-NH-CO- (CH^) s' 
-C«C"CH3-NH-CO- (CHa) s-NH-CO- (GHj) ig-NH-CO- 
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1. A kit for DNA eequencing comprising: 

a first, second, third and fourth dye terminator 
molecule, each of the dye terminator molecules comprising 
5 a dye molecule, a linker of at least 10 atoms in length 
and either ddATP, ddCTP, ddGTP or ddTTP as a mono or tri- 
phosphate and a thermostable DNA polymerase. 

2. . The kit of claim 1, wherein said polymerase is 
a thermostable DNA polymerase that has an altered dNMP 

10 binding site so as to inprove the incorporation of 
dideoxynucleotides relative to the natural polymerase. 

3. A compound of formula (I) : 

A 



15 wherein A is a cyanine dye of the structure 




and the curved lines represent carbon atoms necessary for 
the formulation of cyanine dyes, X and Y are selected from 
the group consisting of 0, S, and 
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CHa-C-CHa/ m is an integer selected from the group 
consisting of 1, 2, 3, and 4, Rl, R2, R3, R4, RS, R6 and 
R7 are independently selected from the group consisting of 
H, OH, CO2H, sulfonic acid or sulfonate groups, esters, 
5 amides, ethers, alkyl or aryl groups and B, and one Rl, 
R2, R3, R4, R5, R6 or R7 is B ; 

B is a linker of at least 10 atoms in length wherein 
the atoms are selected from the group consisting of 
carbon, nitrogen, oxygen, substituted carbon, and sulfur 
10 and the linker is attached at one end to A cind at the 
other end to G; and 

C is a dideoxynucleotide selected from the group 
consisting of 




wherein said linker is covalently bonded to said 
15 dideoxynucleotide at position 7 for ddA and ddG and at 
position 5 for ddC and ddT and wherein r is a mono or tri- 
phospha:te • 
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4. The compound of .claims 3, wherein said linker is 
selected from the group consisting of 

-CeC-CHa-NH-CO- (CHa) 5-NH-CO- , 
-C»C-CH3-NH-C0- (CHa),-NH-SOa-, 
5 -CeC-CHa-NH-C0-(CH3)u-NH-C0-, 
-CeC-CHa-NH-CO- (CHa) 5- * 

-C=C-CHa-NH-CO- (CHa) 5-NH-CO- (CH3) f » and 
-C=C-CHa-KH-CO- (CHj) 5-NH-CO- (CHa) xo-NH-CO- . 

5. A compound of the formula (II): 
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6. A compound of the formula (III) : 
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7. A conipovind of the formula (IV) : 
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9. A deoxyribonucleic acid sequence containing the 
5 compoxmd of formula I. 

10. A deoxyribonucleic acid sequence containing the 
cotrpound of formula II, III/ IV, or V. 

11. A kit for DNA sequencing coraprieing compounds of 
10 formula II, III, IV, and V, 
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12. The kit of claim 11, further comprising a 
thermostable DNA polymerase. 

13. The kit of claim 12, wherein said polymerase is 
a thermostable DNA polymerase that has an altered dNMP 

5 binding site so as to improve the incorporation of 
dideoxynucleotides relative to the natural polymerase. 

14. Method for determining the nucleotide base 
sequence of a DNA molecule comprising the steps of: 

incubating a DNA molecule annealed with a primer 
molecule able to hybridize to said DNA molecule in a 
vessel containing a thermostable DNA polymerase, one 
of a set of four dye terminators with an linker of at 
least 10 atoms between the dye and the nucleotide and 
separating DNA products of the incubating 
reaction according to size whereby at least a part of 
the nucleotide base sequence of said DNA molecule can 
be determined. 

15. Method for determining the nucleotide base 
sequence of a DNA molecule con?>rising the steps of: 

20 incubating a DNA molecule annealed with a primer 

molecule able to hybridize to said DNA molecule in a 
vessel containing a thermostable DNA polymerase, a 
conpound of formula I cuid 

separating DNA products of the incubating 

25 reaction according to size whereby at least a part of 

the nucleotide base sequence of said DNA molecule can 
be determined, 
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16 . Method for determining the nucleotide base 
sequence of a DNA molecule comprising the steps of: 

incubating a DNA molecule annealed with a primer 
molecule able to hybridize to said DNA molecule in a 
5 vessel containing a thermostable DNA, a corapoiind of 

formula II, III, IV, or V and 

separating DNA products of the incubating 
reaction according to size whereby at least a part of 
the nucleotide base sequence of said DNA molecule can 
10 be determined. 



17, The method of any of claims 14, 15, or 16 
wherein said polymerase is a thermostable DNA polymerase 
that has an altered dNMP binding site so as to iit^rove the 
incorporation of dideoxynucleotides relative to the 
15 natural polymerase. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(I) APPLICANT: 



Kumar, Shiv 
Nantpalli, Stayam 
McArdle, Bernard F. 
Fuller, Carl W, 



(ii) TITLE OF INVENTION: 



DIDEOXY DYE TERMINATORS 



(iii) NUMBER OF SEQUENCES: 



(iv) CORRESPONDENCE ADDRESS; 



(A) 
(B) 

(C) 
(D) 
(E) 
(F) 



ADDRESSEE : 
STREET: 

CITY: 
STATE : 
COUNTRY: 
ZIP: 



Lyon El Lyon 

633 West Fifth Street 

Suite 4700 

Los Angeles 

California 

U.S.A. 

90071-2066 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 



3.5" Diskette, 1.44 Mb 
storage 

IBM Compatible 
IBM P.C. DOS 5.0 
FastSEQ for Windows 2.0 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



To Be Assigned 
Herewith 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME : 

(B) REGISTRATION NUMBER: 

(C) REFERENCE/DOCKET NUMBER: 



Warburg, Richard J. 

32,327 

225/219 
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(ix) TELECXDMMUNI CATION INFORMATION: 

(A) TELEPHONE: (213) 489-1600 

(B) TELEFAX: (213) 955-0440 

(C) TELEX: 67-3510 



(2) INFORMATION FOR SEQ ID NO: 1: 



(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GTTTTCCCAG TCACGACGTT 6TA 



Other embodiments are within the following claims. 
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